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II.  ABSTRACT 

Given  n  observations  from  each  of  k  normal  populations  with  common  known 
2 

variance  o  ,  the  value  of  the  largest  of  the  k  means  (whose  values  arc  not  known) 
is  to  be  estimated  using  the  largest  value  of  the  k  sample  means.  It  is  desired  to 
design  a  sampling  rule  which  guarantees  that  the  Mean  Squared  Error  (M.S.E.)  of  the 
estimate  does  not  exceed  a  given  bound  regardless  of  the  configuration  of  values  of 
the  k  means. 

The  M.S.E.  is  a  function  of  A  =  (A,,..., A.  .)  where  A.  =  max  0.  -  0. 

"  i<jf_k 

(A^  is  zero  by  convention).  If  the  A's  are  known  a  smaller  sample  size  (fixed 
sample  size  procedures)  can  be  used  than  the  sample  size  n*  needed  to  guarantee  the 
M.S.E.  requirement  for  all  A.  Sequential  and  multi-sample  procedures  are  considered 
which  attempt  to  use  sample  information  about  A  to  reduce  sample  sizes.  Some  prop¬ 
erties  of  the  sample  size  function  and  M.S.E.  of  these  procedures  are  developed. 
Generally  it  is  found  that  sample  information  about  the  value  of  ~E  is  difficult 
to  use  efficiently. 
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1.  INTRODUCTION  AND  SUKHARY 

Given  k  normal  populations  with  means  0j,...,0k  and  common  variance 

2 

o  .  Based  on  n  observations  from  each  population,  a  point  estimate  of 

0*  *  max(9j, . . .  ,0^)  is  desired.  A  natural  estimator  to  use  would  be 

X*  ■  maxtXj , . . .  ,51^) ,  where  (1  <_  i  <_  k)  is  the  average  of  the  n  observa 

tions  from  the  ith  population.  The  risk  or  mean  squared  error  function 

R (X* ;  0j,...,e^)  is  easily  seen  to  be 

2  » 

(1.1)  R(X*;  01,...,0k)  -  E[(X*-e*)2|01,...,ek]  =  /  y2dn*(y  ♦  (/^A./o)) 

•  00 

2 

where  A^  =  0*  -  0^.  It  is  to  be  noted  that  even  if  a  is  known,  the  risk 

depends  on  the  unknown  nuisance  parameters  (Aj,...,Ak)  (one  of  which  is  zero) 

If  a  single-stage  experiment  is  to  be  designed  (n-chosen)  so  that 

R(X*;  0j,...,0k)  <,  r  (where  r  is  given),  then  an  upper  bound,  say  A*, 

must  be  used  for  the  integral  in  (1.1),  and  a  conservative  n  must  be  chosen 

2 

by  equating  r  to  (o  A*/n) .  On  the  other  hand,  if  multiple  stage  or  sequen¬ 
tial  procedures  can  be  used,  then  it  might  be  hoped  to  reduce  the  total  sample 

size  by  using  information  contained  in  the  earlier  stage  samples  regarding  the 
values  of  the  A^ .  It  is  procedures  of  this  type  which  will  be  investigated 
in  this  paper. 

Since  it  is  easy  to  see  that  (X*-X\ )  (1  <_  i  <_  k)  is  a  collection  of 

strongly  consistent  estimates  of  the  A^  (see  Dudewicz  [2]),  it  would  be 

hoped  that  when  the  desired  sample  size  (obtained  by  equating  (1.1)  to  r) 

is  large,  almost  the  same  sample  size  would  be  obtained  by  replacing  each  A^ 
in  (1.1)  by  its  estimate.  Certainly  one  would  hope  that  the  ratio  of  the  two 
sample  sizes  would  converge  almost  surely  to  unity.  As  will  be  seen,  this 


1 
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turns  out  to  be  the  case  if  the  desired  sample  size  is  made  large  by  allowing 
r  to  go  to  zero,  but  not  if  o2  is  allowed  to  become  infinite  while  r  is 
fixed.  It  will  be  seen  in  the  latter  case  that  the  ratio  mentioned  has  no 
almost  sure  limit.  It  does  have  a  limiting  distribution,  and  this  will  be 
examined.  Most  of  the  results  on  limiting  distributions  for  the  random  sample 
size,  and  for  the  resulting  risk  of  the  estimator  X*  based  on  random  sample 
size  have  been  obtained  only  for  the  case  of  two  populations.  Numerical  evalu¬ 
ations  of  the  limiting  expected  sample  size  and  mean  square  error  functions 
indicate  that  they  do  take  some  advantage  of  the  savings  possible  when  the 
A's  are  known,  and  may  be  of  practical  interest.  Some  simplified  two  sample 

procedures  are  examined  and  found  not  to  be  very  good. 

2 

Estimation  procedures  for  unknown  a  are  considered  in  [1]  in  which  a 
more  extensive  bibliography  is  given. 

2.  k  POPULATIONS,  CONVERGENCE  RESULTS 

It  is  easily  seen  that  the  supreraum  of  (1.1)  over  (Aj,...,Ak)  is 

2 

(o  /n)A*  where 

OD 

(2.1)  A*  *  A*  -  /  y2d*k(y) 

.00 

(see  Blumenthal  [1],  for  example).  The  conservative  approach  to  choosing  n 
so  that  (1.1)  is  at  most  r  (given),  would  be  to  take  n  *  n*  where 

(2.2)  n*  -  A*o2/r  . 

Clearly,  if  the  A^  were  known,  a  smaller  sample  size  could  be  used  by  equating 
(1.1)  to  r.  In  this  section  we  consider  how  much  could  be  saved  if  the  A^ 
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were  known,  and  we  examine  multi-sample  and  sequential  sampling  procedures 
which  try  to  accomplish  these  savings  by  using  estimates  of  the  A^  in  their 
stead.  Define  for  a  given  set 

(2.3)  n2  ■  (rn/A*o2) 

2 

so  that  (1/n  )  represents  the  "efficiency"  possible  if  the  A's  are  known, 
i.e.,  the  ratio  of  the  maximum  sample  size  to  the  actual  size  needed  for  a 
given  set  of  A's.  Also,  define  (assuming  that  the  one  A^  having  value  zero 
is  v 

(2.4)  x^  *  L^r\/k*//r  H  i  k-1,  x^  »  0 
and 

“  ,  k 

(2.5)  H(Xj .... ,xk  l)  *  (1/A*)  /  y\i  n  *(y*x.)  . 

_oo  i=l 

Then  the  result  of  equating  (1.1)  to  r  can  be  expressed  as 

(2.6)  n2  »  H(Xj, . . .  ,Xjc_j)  . 

Some  of  the  properties  of  H(x ^....x^)  are  examined  in  the  Appendix. 

From  Theorem  4.2.5  of  Dudewicz  [2],  it  follows  that 

(def . )  0  ?  k 

(2.7)  R*  =  inf  H(Xj . xfc.1)  >  (l/A*)((l/2)  ♦  /  yV(y))  >  (1/2  A*). 

xr",,xk-l 

2 

From  (2.6)  and  (2.7),  it  is  seen  that  inf  n  »  R*,  or  inf  n  »  R*n*.  Thus 
while  it  is  clear  that  savings  are  possible  when  the  A^  are  known,  the  per¬ 
centage  saving  is  limited  by  the  fact  that  R*  is  bounded  from  below. 
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A  reasonable  approach  to  taking  advantage  of  the  (limited)  savings  avail¬ 
able  would  be  after  taking  n  observations  to  compute  estimates  of  the 

2 

A.,  then  use  these  in  (2.4),  (2.5)  and  (2.6)  to  determine  n  ,  call  this  esti- 
2  2 

mate  n*,  compute  n  as  [n*n*]  where  fx"|  is  the  smallest  integer  not  less 
than  x  and  take  additional  observations  if  n  >  n,  and  stop  sampling  other¬ 
wise.  A  purely  sequential  procedure  would  repeat  this  operation  after  each 
stage  of  sampling,  a  multiple  stage  procedure  having  taken  n^  observations 
from  each  population  up  to  stage  j  and  computing  n^  based  on  these  would 
observe  at  the  next  stage:  max(0,n-nj)  additional  values  from  each  population. 
In  either  case,  it  can  be  seen  that  it  is  always  true  that  n  >  n^,  where 


(2.8) 


nQ  *  (R*A*a2/r)  >  (o2/2r)  . 


Thus  for  a  sequential  procedure,  an  initial  sample  size  of  nQ  must  be  taken 
before  applying  the  stopping  rule.  For  a  multi-stage  procedure  it  would  be 
inefficient  to  use  an  initial  sample  less  than  nQ  thereby  wasting  the  informa¬ 
tion  available  in  the  nQ  observations  which  must  be  taken.  Therefore  we 
assume  a  first  sample  of  n^  for  any  multi-stage  rule.  Define  the  sample 

size  at  stopping  as  N. 

2 

Let  o  be  defined  as 
Kn 

(2.9)  p2  *  (n/n*) 

2 

so  that  represents  the  ratio  of  the  sample  size  at  ‘topping  to  the  maximum 

sample  size  (ignoring  the  possibility  that  n*  is  not  an  integer).  For  two 
stage  sampling  where  N  =  fn*n  *1,  it  is  clear  that 


0  <  p 


2 

N 


(2.10) 


<  (1/n*)  . 


For  sequential  and  multi-stage  procedures,  it  is  possible  to  stop  with  N  >>  hN, 
2-2  -2 

so  that  oN  >>  (the  last  computed  value  of  nn).  For  multi-stage  procedures, 
it  is  clear  that 

(2.11)  pj  >  n* 

N  n0 

because  of  the  method  of  choosing  the  sample  sizes. 

Note  that  an  alternative  way  to  define  a  sequential  procedure  is  to  com¬ 
pute  an  estimate  Rn  of  the  risk  at  stage  n  based  on  substituting  ^  for 
in  (1.1),  and  to  stop  sampling  the  first  time  that  &n  <,  r.  If  R  is  a 
decreasing  function  of  n  for  fixed  (Aj, . . . .A^),  then  the  two  approaches 
are  equivalent.  It  would  be  surprising  if  R  were  not  decreasing  in  n,  but 
a  proof  exists  only  for  k  ■  2  (see  the  Appendix) .  The  alternative  approach 
to  multi-stage  procedures  is  to  compute  A.  based  on  the  observations 
obtained  through  the  jth  stage,  use  this  A  in  place  of  A  in  (1.1),  and 
solve  for  the  n,  say  n  which  makes  (1.1)  eqial  r.  At  stage  (j*l), 

(h-n^)  additional  observations  are  taken  if  n  >  n^ ,  otherwise  sampling  stops. 

The  definition  of  N  used  in  this  paper  assures  that  if  A.  „  =  A.  for 

i,n  l 

all  n,  then  the  resulting  n  will  identically  equal  the  result  of  equating 

(1.1)  to  r  with  the  correct  Ai  in  (1.1).  This  will  be  true  regardless  of 

whether  R  is  a  monotone  decreasing  function  of  n. 

2 

It  is  reasonable  to  ask  whether  pN  has  a  stochastic  limit  if  the  sample 
size  is  forced  to  be  large  by  making  the  desired  risk  r  small.  The  answer 
is  given  by  the  following 

Theorem  1.  Given  a  sampling  procedure  with  an  initial  sample  size  of  n0< 

Given  strongly  consistent  estimates  A^  n  of  the  A^ .  Let  all  A^  (1  <_  i  ^  k-1) 
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(2.12)  lim  pjj  -  lim  nJ  ■  (1/A*)  a.s. 

r-0  r*0 

(2.13)  lira  E(pJ)  -  lim  E(nJ)  =  (1/A*)  . 

r+0  ”  r+0  N 

Proof:  By  the  strong  consistency  of  the  A.  ,  for  given  e  >  0,  there  is 

l  ,n 

an  M  such  that 

(2.14)  PtL  n  >  (A^/2) ,  1  U  <  k,  all  n  >  M)  >  1-e  . 

From  (2.8),  it  is  seen  that  for  r  i  rQ  (say),  no  —  M  and  t*lus  def*ning 

x.  as  (A.  n/A*//r)  r  can  be  chosen  so  that  for  any  arbitrarily  large 

x  ^  n  x  p  n  x 

T  (recalling  that  n  >.  *'ft7  ) 

(2.15)  P{ x.  >  T,  1  <  i  <  k,  all  n  >,  n_)  >  1-e,  if  r  <  r.  . 

x  y  n  u  x 

Since  lim  H(Xj, . . . .x^^)  exists  as  all  the  xi  increase,  and  is  (1/A*), 
from  (2.15)  and  the  fact  that  T  is  arbitrary,  we  conclude  that  for  arbitrary 
6  >  0, 

(2.16)  P{  (1/A* ) - 6  <  njj  <  (1/A*),  all  n  >  nQ)  >  1-e,  if  r  <  ^  . 

a  2 

This  completes  the  proof  for  nN. 

2 

For  two  stage  procedures,  (2.10)  suffices  to  complete  the  proof  for  pn. 

«2 

For  multi-stage  and  sequential  procedures,  (nn  >  (l/A*)-5,  n  i  n^}  implies 
(with  (2.. 11)  and  the  preceding  discussion)  that  (pn  >,  (l/A*)-6).  For  multi¬ 
stage  procedures,  if  stopping  occurs  at  stage  j,  then  N  *  n^  j,  and 

*  fn*n^  Vn*  <  *  (1/n*)  if  (rj^  <,  (1/A*),  n  >  nQ}.  This  suffices 
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to  give  the  result  for  the  multi-stage  case.  For  the  sequential  case,  since 

stopping  occurs  at  the  first  n  >,  fn*n*]  so  that  (n*  <_  (1/A*),  n  ^n^},  it 
2 

follows  that  pn  4  (1/A*)  ♦  2/n*.  This  completes  the  proof  of  (2.12). 

The  proof  of  (2.13)  follows  from  the  dominated  convergence  theorem. 

If  the  A^  are  taken  as  (X*-X^)  as  mentioned  in  Section  1,  it  can  be 
seen  that  (2.12)  holds  as  a  -*■  »  provided  that  A.  *  A.  (o)  and  lim  A.(o)  *  ®. 

1  1  0-X»>  1 

Further,  note  that  Theorem  1  gives  a  very  strong  limitation  on  the  use¬ 
fulness  of  the  proposed  sequential  procedure.  When  the  Ai  (>  0)  are  fixed, 
this  procedure  takes  the  same  (order  of  magnitude)  number  of  observations  as 

would  be  the  case  if  the  A^  were  ail  infinite.  It  should  be  noted,  however, 

2 

that  ever,  if  the  A^  were  known,  as  r  0,  n  goes  to  (1/A*).  Thus  the 
sequential  (or  multiple  sample)  rule  docs  as  well  as  can  be  done  knowing  the 
A ' s .  If  k  »  2,  A*  *  1,  and  this  is  no  better  than  the  conservative  proce¬ 
dure.  For  k  ^  3,  some  savings  are  possible  relative  to  the  conservative 
procedure  since  A*  >  1,  The  situation  when  Aj  *  •  •  •  ■  A^  j  *  0  is  somewhat 
different  as  is  shown  below,  in  Theorem  2. 

An  additional  point  to  remark  here  is  that  N  for  the  procedures  being 

2 

considered  can  never  exceed  the  conservative  sample  size  (A*o  /r)  (i.e., 

2 

nN  4  1).  Thus,  N  has  both  lower  and  upper  limits. 

In  examining  (2.4),  (2.5)  and  (2.6)  it  is  seen  that  although  a  sizeable 

savings  in  sample  size  can  be  achieved  for  any  value  of  r,  the  particular 

A^  values  at  which  these  savings  are  achievable  depend  on  r.  In  particular, 

_  2 

suppose  (Xj, . . .  ,x^  j)  *  x  is  fixed.  Then  ry  can  be  computed  from  (2.6) 
and  the  a^  values  at  which  this  relative  sample  size  will  occur  are  given  by 


(2.17) 
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If  one  fixes  the  value  of  x  and  allows  the  A^  to  be  given  by  (2.17),  the 

question  is  whether  the  potential  savings  now  available  regardless  of  r  can 

be  achieved  in  any  sense  by  the  procedures  which  determine  the  sample  size  by 

using  the  estimates  A.  in  place  of  the  A..  The  following  theorem  gives  a 

1 1  n  x 

2 

negative  answer  in  the  sense  that  pn  does  not  converge  with  probability  one 
-2  2 

to  ny.  In  fact  pN  does  not  have  an  almost  sure  limit. 

Consider  the  natural  estimates  of  A^,  namely  (X*-X^).  Note  that 

(/n/o) (X*-X^)  is  distributed  as  (Y*-Y^)  where  the  Y^'s  are  normal  with 

2 

means  (-*^A^/a),  and  unit  variance.  If  n  •  c(A*o  /r)  (for  any  c  >  0)  and 

A^  is  given  by  (2.17),  these  means  are  (<^cx^/nx),  independently  of  r.  For 

n  of  this  form,  the  distribution  of  (i/n/o)A^  n  (given  that  A^  n  >  0,  i.e., 

that  we  are  not  looking  at  the  one  value  of  A.  which  by  definition  is  zero) 

l ,  n 

will  assign  positive  weight  to  all  intervals  on  the  positive  axis,  and  the 

weight  will  be  independent  of  r.  Further,  the  conditional  joint  distribution 

of  the  (k-1)  positive  values  of  (/n/o)A.  (given  that  they  are  positive) 

i  ,n 

assigns  positive  weight  to  all  measurable  (k-1)  dimensional  sets  having  posi¬ 
tive  Lebesgue  measure,  and  the  weight  is  independent  of  r.  Below  we  restrict 
attention  to  estimates  A.  having  this  property,  which  we  refer  to  as 

i 

property  P . 

Theorem  2.  Let  x  be  fixed  and  the  A-  be  given  by  (2.17).  Let  the  esti- 

2 

mates  a^  n  have  property  P  defined  in  the  preceding  paragraph.  Let  n- 

2  2 

be  given  by  (2.6)  and  nN  be  as  in  Theorem  1,  and  pN  be  given  by  (2.9). 

—  2 

Then  for  any  x  such  that  R*  <  ry  <  1 ,  there  is  an  e  >  0  such  that 

lira  P{j(pS/nl)-i|  >  c)  >  0  . 
r+O 


(2.18) 


r 


Proof:  Let  6  <  1  be  sufficiently  large  so  that  H(Xj,...,x^  j)  is  a 

decreasing  function  of  each  of  its  arguments  on  the  set 

-  {(x1,...,xkl):  4  <  H(Xj . <  1;  0  i  xi  I  xi(6)»  i  -  1 , . .  ,k-l } 

where  x”(6)  is  the  smaller  of  the  two  solutions  to 


I 

l 

1 

I 

I 


inf  H(x. , . . .  ,x.  ,)  *  6 

1  K‘1 

(if  there  are  two  solutions,  i.e.,  if  6  <  (1/A*)).  Letting  Ai(6)  ■  (x”(6)//6), 

it  can  be  seen  from  the  monotonicity  of  x’(5)//fi  (see  Remark  2,  Appendix)  that 

2 

the  set  of  (A^, . . .  .A^^)  such  that  {6  5_  n  ^  1)  contains  the  set 

Tj  ■  (/r/A*  Sfi//6)  (i.e.,  each  point  in  is  multiplied  by  the  given  factor). 

2 

Let  n  ■  (cA*0  /r)  (c  >  0)  and  the  A's  be  given  by  (2.17).  Let 

A  »  (£  .....A.  .  )  (assume  A.  ■  0  for  notational  convenience) 

n  x }  ii  |  n  k  j  n 

(nj  >  &)  3  {(/n/o)A  e  (^/o)Tfi}  -  { (/S/o) A  e  JZ Sg/  /fi)  . 

From  property  P  and  the  fact  that  /cSfi//6  is  measurable  with  positive 
Lebesgue  measure,  it  follows  that 

(2.19)  P(n2  >  5)  >  0  . 

n 

If  n  is  taken  to  be  n^,  then  c  ■  R*  and  using  (2.11)  along  with 

(2.19)  we  see  that  for  multi-stage  procedures  (two  or  more  stages), 

(2.20)  P(pJ  >  6)  >  P(n2  >  6}  >  0  . 

N  n„  “ 


l 
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_  2 
If  x  is  given,  6  can  be  taken  as  (l*e)ny,  where  e  is  sufficiently  large 

to  satisfy  the  restriction  on  6.  This  proves  (2.18)  for  multi-stage  procedures. 

Consider  now  the  sequential  procedure.  At  stage  n,  it  stops  if  n  ^  n  , 

and  since  it  can  stop  prior  to  reaching  stage  n, 


(2.21)  P[N  <  n]  >  P[n  >  nn]  . 

Let  n  ■  fn*6*| .  Now  n  ■  fn*n  1.  Noting  that  (n  <  Al  (n  >  n  ),  and  that 

n  n  n  ™  n 

(N  fn*6]}  (p*  ^  6*(l/n*)},  we  can  conclude  using  (2.21)  that 

(2.22)  P<pjJ  <  «+(l/n*)}  >  P( njj  <  6),  when  n  •  fn*6]  . 


In  this  case  let  6  >  R*  and  sufficiently  small  so  that  we  can  choose 

the  set  S&  such  that  it  is  connected,  and  on  Sfi,  R*  <_  H(Xj,...,xk  j)  <  6 

and  H(Xj, ..  ..x^j)  is  decreasing  in  all  of  its  arguments.  Clearly  Sfi  has 

positive  Lebesgue  measure  and  because  of  its  definition,  the  set  (Aj,...,^  ^ 

such  that  (R*  <  n  <  6)  contains  T.  ■  (/r/A*  U.)  and  U.  is  obtained  from 

—  —  6  0  0 

2 

Sfi  by  taking  each  (Xj , . . .  .x^)  on  the  contour  H(Xj,..  ..x^j)  ■  n  , 

2 

(R*  <_  n  ^6)  and  transforming  it  to  (Xj/n, . ..jX^  j/n).  The  monotonicity 
of  this  transform  on  S.  follows  from  Remark  2  in  the  Appendix.  As  before, 
with  c  ■  6,  it  follows  that 


(2.23)  P(f^  <  6}  >  P((/n/o)A  e  ✓T U^)  >  0,  when  n  -  An*  . 

2 

Letting  6  ■  (l-c)ny  where  e  >  0  is  chosen  to  satisfy  the  above  requirements, 
(2.22)  and  (2.23)  suffice  to  demonstrate  (2.17)  for  the  sequential  case,  com¬ 
pleting  the  proof  of  the  theorem. 
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Remarks: 

1)  The  proof  of  Theorem  2  suggests  that  although  N  and  pN  do  not 
have  a  probability  limit,  they  should  have  a  distribution  which  may  have  a 
limit  as  r  -*•  0.  For  sequential  and  multi-stage  procedures,  the  distribution 
would  be  very  difficult  to  obtain  analytically,  but  for  two  stage  procedures 

it  should  be  relatively  simple.  In  view  of  (2.10)  the  limiting  distribution 

2  -2 
of  Pj.  (as  n*  -*•  ")  will  be  the  same  as  the  distribution  of  n  which  is 
n  nQ 

independent  of  n*.  Some  aspects  of  this  distribution  will  be  studied  for 
k  ■  2  in  the  next  section. 

2)  Setting  x  »  (0,...,0)  shows  that  the  result  of  Theorem  1  does  not 

hold  when  all  A^'s  are  zero,  and  by  remark  (1)  above,  it  can  be  expected  that 
2 

PN  has  a  limiting  distribution  in  this  case.  In  fact,  when  any  subset  of  p 

2 

of  the  A's  are  zero  while  the  remainder  are  fixed,  the  resulting  pN  will 
have  a  limit  distribution  found  by  studying  the  distribution  of 

H(Xj , . . . , Xp,®, . . . ,*) . 

5)  Large  sample  sizes  are  needed  either  if  r  is  small  as  above  or  if 

2  2 
o  is  large.  The  behavior  of  N  and  pN  in  the  latter  case  can  be  studied 

2  2 

by  taking  limits  as  a  -*■  ®  (o  known).  In  this  case  for  fixed 

2  2 

A  ■  (Aj,...,A^  j),  and  fixed  r  it  is  clear  that  n  (■  n^)  is  given  as 
the  solution  of  (2.4)  and  (2.5)  and  n  is  obtainable  using  (2.3).  The  value 

2  2 

of  (n/o  )  is  thus  independent  of  a  .  For  given  A.  (1  <  i  <  k),  it  is 

i,n 

2  2 
also  clear  that  (n/o  )  has  a  value  which  is  independent  of  o  and  which 

depends  truly  on  the  value  of  A^  .  In  this  case  it  is  very  reasonable  to 
2  2* 

ask  whether  p^  converges  to  when  the  A^  converge  almost  surely  to 

2 

the  Ai  (for  fixed  o  ,  as  n  increases). 
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For  the  rules  under  consideration,  sample  sizes  are  of  the  form 

n  ■  co^  (c  >  0)  so  that  the  nonzero  values  of  {(*^n/o)&,  ,  1  <_  i  <  k } 

i  »  n  — 

2 

have  a  conditional  joint  distribution  independent  of  o  which  puts  positive 
mass  on  any  measurable  set  having  positive  Lebesgue  measure.  The  method  of 
proof  of  Theorem  2  gives  as  an  immediate  corollary,  the  following 

Corollary  1.  Let  A  *  (Aj, . ...A^  j)  be  fixed.  Let  the  distributional  prop- 

2  2 
erty  P  hold  when  n  *  co  .  Then  for  any  A,  such  that  R*  <  n.  <  1,  there 

A 

is  an  e  >  0  such  that 

(2.24)  lim  P{|(pJ/nJ)-l|  >  c)  >  0  . 

-2 

It  is  clear  in  fact  that  n  will  have  a  distribution  which  does  not 
2  0 

depend  on  o  ,  and  for  the  two  sample  case  the  limiting  process  is  needed 

2 

only  to  assure  sufficiently  large  n  so  that  the  discrete  variable  pn  will 

2 

be  close  to  the  continuous  n^. 

It  should  be  noted  that  the  problem  considered  in  this  section  differs 

2 

in  an  important  way  from  the  case  in  which  o  is  unknown,  and  the  A^  are 
eliminated  from  consideration  by  some  means  (e.g.,  see  Blumenthal  [1]),  and 
from  the  usual  situations  in  which  sequential  or  multi-stage  estimation  proce¬ 
dures  are  used.  In  these  other  cases,  the  desired  sample  size  is  an  unbounded 
function  of  the  unknown  nuisance  parameter,  and  consequently  two-stage  proce¬ 
dures  tend  to  be  inefficient  relative  to  sequential  procedures  due  to  the 
possibility  that  the  initial  sample  size  may  be  far  too  small.  In  the  problem 
considered  here,  the  desired  sample  size  is  a  bounded  function  of  the  nuisance 
parameter  (Aj.-.-.A^  j),  and  in  fact  is  bounded  below  as  well  as  above. 

Thus  it  is  possible  to  choose  an  initial  sample  size  for  a  two  stage  procedure 
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such  that  no  sequential  procedure  could  stop  with  fewer  observations,  and  such 
that  this  initial  sample  represents  a  sizeable  fraction  of  the  ultimate  sa.aple 
size  (with  a  value  of  about  0.80  when  k  ■  2  and  a  limit  of  (1/2)  as  k 
increases  to  infinity).  Thus  the  information  available  to  determine  the  value 
of  N  is  almost  as  great  for  the  two  sample  procedure  as  for  a  sequential  one, 
and  little  would  appear  to  be  gained  by  using  a  complex  sequential  stopping 
rule  in  place  of  the  simple  two  sample  one.  This  conclusion  is  supported  by 
Theorems  1  and  2  which  show  both  procedures  to  behave  similarly  in  the  limit. 


3.  TWO  POPULATIONS;  DISTRIBUTIONAL  RESULTS  FOR  TWO  STAGE  SAILING 

In  this  section,  the  distribution  of  the  sample  size  and  risk  function 

2 

will  be  studied  for  the  two  stage  sampling  procedure  when  either  a  is  large 
and  the  A^  are  fixed,  or  r  is  small  and  the  A^  are  proportional  to  /r. 
The  discussion  at  the  end  of  section  2  is  taken  as  the  rationale  for  not 
attempting  the  extremely  difficult  task  of  studying  these  for  sequential  or 
multi-stage  procedures.  Theorem  2  and  Corollary  1  provide  the  incentive  to 
discover  just  what  the  sample  sice  and  risk  behavior  is  in  these  cases  for 
which  we  do  not  have  stochastic  convergence  of  any  sort. 

Although  it  should  be  possible  to  characterize  the  risk  and  sample  size 
distribution  for  any  k,  the  analysis  is  greatly  simplified  when  k  »  2 
(especially  for  the  risk  function)  and  we  shall  now  specialize  to  the  two 
population  case.  Some  specialized  notation  will  help  in  this  study. 

Let 


(3.1) 


(X.  *X  )/2; 
i,n  z,n 


<Xl,n-X2.n>/2' 


(e1^e2)/2;  V  -  (01-e2)/2;  U  -  |v|  . 


(3.2) 


a  ■ 
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Then 

(3.3) 

X*  ■  Zn  +  Z  ,  6*  ■  a  ♦  w 

n  0,n  n 

and  ZQ  is  normal. 

2 

mean  a,  variance  (o  /2n)  while  Z^  n 

is  normal  mean 

2 

v,  variance  (o  /2n) 

and  Z.  is  independent  of  Z,  ,  hence 

u,n  l,n 

of 

Z  .  Note 
n 

that  a)  corresponds  to 

(6j/2)  in  the  general  notation  and  Z^ 

to 

The  risk  (1.1)  specializes  to 


(3.4)  (o2/n)(l  ♦  2F (x) ) 
where 

(3.5)  F(x)  -  x2*(-x)  -  x*(x) 
and 

(3.6)  x  ■  ui/Jn/o  ■  nu/(2/r) 
with 

(3.7)  n2  ■  m/o2  (since  A*  -  1)  . 

The  function  F(x)  is  zero  at  x  ■  0,  decreases  to  (-0.1012)  at  x*  *  0.6120, 
where  x*  is  the  unique  solution  of  $(x)  ■  2x4 (-x),  then  rises  again  to  zero, 
so  that 

(3.8)  R*  «  inf (1  ♦  2F (x) )  -  1  -  x*4(x*)  »  0.7976  . 

Corresponding  to  (2.6),  we  have 


(3.9) 


n2  ■  (1  ♦  2F(x))  . 


IS 


2  . 2 

Based  on  an  initial  sample  of  n^  «  R*n*  observations  (with  n*  ■  o  /r),  n 
is  found  from 

(3.10)  n2  -  1  ♦  2F(nZ  /fir)  . 

n0 

2 

Then,  N  ■  fn*n ] ,  and  an  additional  (N-n^)  observations  are  taken.  The 
-2 

distribution  of  n  is  easily  found.  Re-write  (3.6)  as 

(3.11)  x  »  nu)/2n^/o/fF 
and 


(3.12)  x  -  nZ  /2nZ/o/R*  . 

n0  ° 

Further,  for  R*  <  6  <  1 

(3.13)  (n2  <  6)  <-»  ( (x"  (6)/ /6)SR*  <_  l  /2iT/o  <  (x*  (6)//6)^} 

n0 

where  x*(6)  <  x*(6)  are  the  two  solutions  of  6  ■  1  ♦  2F(x).  Note  that 

Z  /2nn/o  is  |W|  where  W  is  Normal,  mean  (/2nft/o)u>,  and  variance  1. 
n0  u  u 

Henceforth,  assume  that 


(3.14) 


A  convenient  parametrization  is  obtained  by  letting 

(3.15)  B  •  x/R*/(l  ♦  2F(x))  . 

This  choice  of  B  gives  u  such  that  the  solution  of  n2  ■  1  +  2F(nu>/2n0/o/R*) 
2 

is  simply,  n  ■  1  ♦  2F(x).  Note  that  this  choice  makes  u>  proportional  to 
Jr ,  so  that  if  limits  are  taken  as  r  ■*  0,  then  u>  is  varying  with  r  as 
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2 

in  Theorem  2,  and  if  r  is  fixed  and  a  ■*  ®,  then  w  is  fixed  as  in  Corollary 
1.  Combining  the  above,  when  (3.14)  holds, 


(3.16) 


P(n2  <  <5)  ■  ♦[(x*^V/6)-8)  -  4((x'^V/6)-6] 

♦  4(-(x"^V/6)-e)  -  *[-(x*/RV/6)-e]  . 


2 

From  (2.10),  we  see  that  lim  P(pN  <  6}  is  given  by  (3.16).  It  is  easily  seen 

n*-»« 

from  the  bounded  convergence  theorem  and  (2.10)  that  all  moments  (positive  and 

2  2 
negative)  of  pn  converge  to  the  corresponding  moments  of  n  ,  as  n*  ■*  <*>. 

It  is  not  especially  convenient  to  use  the  distribution  (3.16)  to  find 
_  2 

the  moments  of  n  since  it  involves  inverting  (1  ♦  2F(x))  to  obtain  x 

and  x+.  Let  Y,  be  a  standard  normal  variable.  We  can  write  (/2n.  Z  /a) 

1  0  no 

2  2  u 

as  |Yj  ■>  ej  (6  given  in  (3.14)),  and  then  n  *  n  (| Yj  ♦  b|)  where 

•t 

rf  (x)  is  the  solution  of 

(3.17)  n2  -  1  ♦  2F(nx/*/ft*)  . 


Then 

OD  OD 

E{(n2)p)  •  /  (n2  |  y>6  |  )p  <Ky)dy  ■  /  (n2  (-y-f))p4>(y)dy  ♦  /  (n2 (y*B) )P4» {y )dy 

-«•  -•  -B 

(3.18) 

OD 

■  /  (n2(u))p[#(u*6)  ♦  ♦(u-B)Jdu  . 

0 

2 

This  integral  still  involves  the  implicitly  defined  function  n  (u).  Now,  let 

(3.19)  v  *  nu//?T*,  or  u  *  /R*v/r\ 


so  that 


(3.20) 


n2  -  1  ♦  2F (v) 
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and 


(3.21) 


(^Vn2)(n  -  -  (^/n3)(l  ♦  2F(v)  -  vF'(v)) 


(*fiVn3)(l  -  v*(v))  . 


Using  this  change  of  variables,  (3.18)  becomes 


(3.22) 


E{(n2)P)  -  /R*  f  n(2p"3)(v)(l  -  v^(v))[*(*fiMv/n(v))*6) 
0 

♦  6(»1F(v/n(v))-6)]dv 


where  n(v)  is  now  given  explicitly  by  (3.20).  For  notational  convenience, 
define 


(3.23) 


y(v)  -  ♦✓R*/(l  ♦  2F  (v) )  . 


Then,  (3.22)  becomes 


P  /  °° 

(3.24)  E{  (n2)  }  -  0*)(P_1)/  ^Wll-  v4(v))[4(vY(v)>8)*4(vY(v)-6)]dv  . 

0 

Next  the  aoaents  of  (X*-6*)  will  be  studied,  from  which  the  risk  can 
be  derived.  Using  (3.3), 

(J.2S)  (Xj5-9*)P  •  ♦  <V>JP  ■  jo  (?)tZ0.N-<,)i(ZN-“)P'i  ' 


Also,  conditioning  on  N  ■  n,  and  noting  that  the  stopping  rule  is  based 

on  Z  and  that  Z.  is  independent  of  Z  ,  hence  independent  of  N,  it 
n0  °»n0  n0 
it  seen  that 


(3.26)  E(t(Z0  n-o)i(Zn-w)P'il|N-n}  •  (o/^n)iE(Wi)E((Zi|-«;r'*|N-nJ 
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hence 

(3.27)  E<(20 ,N-°,)i(ZN-“)P’i)  "  E(Wi)(o//2)iE((Zn-w)P‘i  M‘i/2] 

where  W  is  a  standard  normal  variable.  Next,  note  that  Z,  ,,  can  be 

1  ,N 

rewritten  as 


Jrtn.Z,  ♦  nZ,  -v] 

Nl  0  l,nQ  l,nJ 

where  n  -  N-n.  and  nZ.  'v  is  the  sum  of  n  normal  random  variables,  with 
0  l,n 

2 

mean  u>  and  variance  a  / 2 ,  which  are  independent  of  Z.  and  of  one 

1,n0 

another.  Hence,  we  shall  write 


(3.28) 


Z1,N  ‘  U/Nm^o/^Yj  ♦  (A  o//2)Y2  ♦  No.] 


where  Yj  and  Y2  are  independent  standard  normal  variables.  Using  (2.9)  and 
(3.14),  we  obtain 

(3.29)  Z1N  -  (r/2R‘)1/2{(R*/pJ)Y1  ♦  {(R*/pJ)d  -  (RVpjJ))] 1/2  Y2  ♦  8}  . 

-2 

Note  that  n  is  given  in  (3.17)  as  a  function  of  Yj.  Using  (3.29),  (2.10) 
and  the  fact  that  R*  <_  (P*/p^)  <  1,  the  dominated  convergence  theorem  yields 
for  any  j, 


(3.30)  lim  E((Z J^r)h  -  (2R*)’j/2E(|(R*/n2)Y  ♦  [(R*/n2)(l  -  R*/n2))]1/2Y  ♦  B|j 
n*-w» 

Let  y(Yj)  ■  y(|Yj  ♦  8|)  be  given  by  (3.23),  (i.e.,  y2  ■  (R*/n2)),  and  con¬ 


sider 
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E{|y2(Y1)Y1  ♦  [Y2(Y1)(1  -  y2(Yj))]1/2  y2  4  6|) 

■  EE(|Y2(y)y  ♦  [Y2(y)(l  -  Y2(y))]1/2  Y2 ♦  B| |Yj  -  y } 

(3.31)  -CA/B)  - 

■  E{  /  (-A-Bu)#(u)du  ♦  /  (A+Bu)<J>  (u)du) 

-  CA/B) 

-  E{A(1  -  2*(-A/B))  ♦  2B$ (A/B) } 

where 

(3.32)  A  -  Y1y2(Y1)  *8,  B  -  [y2(Yj) (1  -  y2(Y1))]1/2  . 
Also, 

E(|y2(Y1)Y1  ♦  (y2(Y1)(1  -  Y2(Y1))]1/2Y2  ♦  B  | 2) 

(3.33) 

-  EE(A2  ♦  2ABY2  ♦  B2Y2 1 Yj }  -  E{A2  ♦  B2}  . 


Combining  (2.8),  (2.9),  (3.14),  (3.25),  (3.27),  (3.30),  (3.31),  (3.32),  and 

(3.33)  we  obtain 

lim(l/r)E(X*-0*)2  -  (1/2R*)E(y|Y4(Y  )  ♦  [Y2(Yj) (1  -  y2(Yj))] 

n*-H» 

(3.34)  , 

♦  4B(A4(-A/B)  -  B$(A/B)]  ♦  Y  (Yj))  . 


2 

Note  that  the  last  Y  (Y^)  term  in  (3.34)  represents  the  contribution  of 
ZQ  N  to  the  risk,  while  the  rest  represents  the  contribution  of  ZN.  To 
evaluate  the  expectations  in  (3.34),  it  is  necessary  first  to  break  the 
integrals  into  the  two  components  Y^  ♦  B  <  (>)  0  as  in  (3.18).  Some 
simplification  is  then  possible  by  partial  integrations,  e.g.. 


20 


00  0D 

/  (y-8)2Y4(y)$(y-e)dy  -  -8y4(0)$(6)  ♦  /  [y4 (y)  ♦  4(y-e)Y3(y)ir  y (y) )«*> (y-e)dy  . 
o  o  y 

4 

In  this  way,  the  y  (Y^)  term  can  be  eliminated.  Lastly,  the  change  of  vari¬ 
ables  (3.19)  can  be  used  to  get  all  functions  defined  explicitly  (when  this  is 
done,  the  (dy(y)/dy)  factor  is  absorbed  into  the  differential  for  the  new 
variable) . 

Using  (3.30)  and  (3.31)  it  is  also  possible  to  evaluate  the  bias  of  the 
estimator  X*  which  by  (3.17)  and  (3.27)  is  just  E(ZN-u).  We  summarize  the 
results  as  follows: 


(3.35a) 

(3.35b) 

(3.35c) 

(3. 3Sd) 

(3. 35e) 


limVf  E(X*-0*)  -  (/2/R#3/2)[I4  -  I3] 
n*-n» 

lim  ^E(Z0N-a)2  -  (1/2R*2)I2 
n*-*» 

lim  i-E(ZN-a,)2  -  (1/2R*2) (4Ij  ♦  I2  ♦  4bI3) 
n*-*«® 

lim  Ie(X*-0*)2  »  (1/R*2)[2I1  ♦  I2  ♦  20I3] 


lim(l/n*)E(N)  «  I$ 
n*-H» 


where 


(3.36a)  I  «  /  [4> (v)  -  2v*(-v)]y6(v)[C(v)*(C(v))  ♦  D(v)*(D(v))]dv 
0 

00 

(3.36b)  I  «  /  y5(v)[1  -  v<Kv)3 (♦(C(v))  ♦  *(D(v))]dv 

0 

00 

I,  -  /  ((Y2(v)C(v)>6)*(-(Y2(v)C(v)*B)/B(v))<KC(v)) 

3  0 

(3.36c)  -(Y2(v)D(v)-6)^((Y2(v)D(v)-e)/B(v))^(D(v)) 

-2B(v)4.(vy2(v)/B(v))4>(6/y(v))}y3(v)[1  -  v$(v)Jdv 
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(3.36d) 

(3. 36e) 

and 


(3.37) 


I,  -  /  £<.(v)  -  2v*(-v)]y4(v)U(C(v))  -  4>(D(v))]dv 
0 

00 

U  -  /  Y(v)[l  -  v*(v)][<HC(v))  ♦  4>(D(v))]dv 
3  0 

B2(v)  -  y2(v)(l  -  y2(v)) 

C(v)  »  vy(v)  -  6 
D(v)  ■  vy(v)  ♦  b, 


with  Y(v)  given  by  (3.23). 

One  question  left  unanswered  is  the  following.  Theorem  1  gives  the 
2 

almost  sure  limit  of  pn  as  r  -*■  0  when  the  ^  are  fixed  and  positive.  How 
2 

does  E(X^-e*)  behave  in  that  case?  The  preceding  development  can  be  used  to 
answer  the  question  for  k  =  2  and  the  two  sample  procedure.  The  development 
of  (3.34)  can  be  used  to  write  the  exact  expression 

(l/r)E(XjJj-0*)2  =  (1/2R*)E(Y2YJ(Y1)  ♦  [y^M  -  Y^))] 

(3'38)  ♦  4B[A*(-A/B)  -  B*(A/B)J  +  yJcYj)} 

2  2  2  2 
where  Yn(yj)  *  R*/pn>  and  where  yn  is  used  for  y  in  A  and  B.  The 

?  . _ 

pointwise  limit  of  Yn(Yj)  as  B  ®  (recall  8  =  /2R*/r  u>,  ui  fixed)  is 

R*  and  dominated  convergence  allows  immediately  taking  limits  under  the 
expectation  for  all  terms  except  4B[A$(-A/B)  -  B4> (A/B) ] .  This  term  has  a 
pointwise  limit  of  zero  as  B  increases,  as  is  easily  seen  from  the  standard 
tail  approximation  to  the  normal  distribution,  and  it  is  easy  to  verify  that 
the  expectation  has  a  limit  of  zero.  Thus, 
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(3.39)  lim  E(X*-0*)2  -  lim  (3.35d)  -  1  . 

r-K)  B-*» 

In  general  (k  populations,  sequential  or  multiple  sampling),  it  is  conjectured 
that  the  risk  when  the  are  fixed  and  r  -*•  0  will  be  the  same  as  if  a  non- 

random  number  n  ■  (n*/A*)  of  observations  were  taken.  This  would  give 

00 

lim  E(l/r) (X*-e*)2  -  lim  /  y2dn*(y  ♦  ■  1  . 

r+0  n  r-*0  -»  1 

Since  the  integrals  in  (3.36)  are  quite  complex,  it  is  very  difficult  to 

tell  from  looking  at  them  how  the  expected  sample  size  and  risk  behave,  except 

2 

in  the  limiting  case,  where  B  -*■  ».  In  that  case,  n  (|Y  ♦  b|)  approaches 

unity  for  all  Y  and  dominated  convergence  allows  the  conclusion  that 

lim  H{ r»2 ( | Y  ♦  B|)}  •  lim  E(n~2(|Y  ♦  $()}  ■  1  so  that  (3.35b)  approaches  (1/2) 

and  (3.35e)  approaches  unity  as  6  increases.  We  have  seen  already  that 
(3.35d)  approaches  unity  and  the  same  argument  shows  that  (3.35a)  approaches 
zero  and  (3.35c)  approaches  (1/2).  For  finite  B,  since  n  ( | Y  ♦  B|)  <  1, 
it  can  be  concluded  that  (3.35b)  will  exceed  (1/2)  and  that  (3.35e)  will  be 
less  than  unity.  Otherwise,  the  behavior  is  not  ascertainable  from  examination 
of  the  formulas.  Therefore  numerical  integrations  have  been  performed,  and  the 
results  are  given  in  the  following  section. 


4.  NUMERICAL  RESULTS 

The  performance  of  the  two  sample  procedure  as  given  by  (3.35)  was 
evaluated  numerically  for  a  range  of  values  of  u>.  If  u  were  known  then 
n  could  be  chosen  so  that  (3.4)  equals  r  (given)  and  the  value  n^  defined 
in  this  way  can  be  considered  as  an  ideal  value  under  perfect  information.  To 


▼ 


l 
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evaluate  the  effectiveness  of  the  two  sample  procedure,  E(N)  should  be  com- 

1  2 

pared  to  n^,  and  pE(X*-0*)  should  be  compared  to  unity  which  is  the  value 
1  2 

of  - E(X*  -9*)  .  Similarly  the  biases  of  the  ideal  and  two  sample  procedures 

w 

can  be  compared.  A  convenient  reparametrization  is  the  following.  Let  x  be 
a  running  parameter  and  define 


(4.1) 


mx  ■  a  (1  ♦  2P(x)),  and 


oj  ■  x//2n 
x  x 


Clearly  nx  is  n^  .  The  operating  characteristics  of  the  two  sample  procedure 

x 

have  been  computed  for  oj  ■  u>x,  as  x  traverses  a  suitable  range. 

Table  1  gives  for  each  x,  Ax  where  >x  ■  ^✓n*’  and  n*  *  cj  / r.  This 

is  followed  by  n°  ■  nx/n*  and  Nx  ■  lim(E(N)/n*)  (as  given  by  (3.35)).  Next 

n*-*» 

—  1  2 

is  Mx  o  lim  —  E(X*-e*)  ,  and  its  two  constituent  components  fTQx  ■ 

n*x»  _ 

lim  iE(Z0N-a)2  and  MJx  -  lim  j;E(Zn-u>)2.  After  this  is  Bj  -  -/fE(X*  >6*), 
n*-*»  n*-*®  x 

the  normalized  bias  of  the  ideal  or  perfect  information  procedure  (Bj  ■ 

-2AxF(x)/x2),  and  Bx  -  lim  -^^E(XjJ-0*) .  Finally  for  reference,  the  norma - 

n*-*» 

lized  mean  squared  error  (M  )  and  bias  (B  )  of  the  conservative  single 

c  c 

sample  procedure  whose  sample  size  is  n*  are  given  (evaluated  at  w  *  u>x,  so 
that  Mc  «  1  ♦  2F(/2  Ax),  and  Bc  «  -F(A  XXJ/AX) • 

Comparison  of  E(N)  with  nx  shows  that  E(N)  tends  to  be  flatter  as 
a  function  of  x,  lying  below  nx  at  extreme  values  and  above  in  the  central 
region.  The  M.S.E.  of  X*  is  not  quite  as  flat  as  might  have  been  hoped. 

Note  however  that  it  is  below  unity  at  x  >  0.20  and  at  x  *  1.50  even  though 
E(N)  <  nx,  indicating  that  the  two  sample  procedure  may  be  somewhat  more  effec¬ 
tive  in  using  the  observations  taken  than  is  the  one  sample  procedure.  Consider 
the  M.S.E.  of  a  single  sample  procedure  whose  sample  size  n  equals  E(N)  of 


4 


the  two  sample  procedure.  Table  2  gives  a  few  selected  values. 


Table  2 


X 

(n/n*) 

n*E(X*  -  9*)2 

M 

X 

.00 

.867 

1.151 

1.103 

.15 

.867 

1.040 

1.004 

1.00 

.890 

0.942 

0.928 

1.55 

.914 

1.005 

1.002 

2.00 

.936 

1.025 

1.046 

2.50 

.958 

1.030 

1.064 

4.00 

.995 

1.005 

1.022 

It  is  clear  that  for  small  x,  the  two  sample  procedure  uses  its  observations 
more  effectively  than  does  a  comparable  single  sample  one,  but  for  larger  x 
values  the  M.S.E.  of  Xj^  decreases  more  slowly  toward  its  asymptote  of  unity 
and  gives  slightly  higher  M.S.E.  values  than  the  comparable  single  sample 
estimate. 

A  similar  behavior  in  the  bias  of  the  two  sample  procedure  relative  to 
Bj  is  also  noted. 

It  was  demonstrated  that  (N/n^)  does  not  converge  stochastically  to 
unity  as  n*  increases,  and  the  numerical  results  show  that  (E(N)/nx)  is 
not  too  close  to  unity  either.  Table  1  also  shows  that  MSE(Xj^)  fails  to 
achieve  the  goal  of  being  constant  at  unity.  In  spite  of  these  facts,  it  is 
seen  that  the  two  sample  procedure  does  take  some  advantage  of  the  possible 
savings  available  when  u  is  known,  and  the  M.S.E.  curve  does  not  rise  very 
far  above  unity.  Compared  to  the  conservative  procedure,  about  a  10%  saving 
in  sample  size  is  achievable  for  hoderate  values  of  x,  and  the  two  sample 
procedure  may  very  well  be  acceptable  in  practice. 
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A  simplified  two  sample  procedure  can  be  constructed  in  the  following 

way.  Take  n.  observations  and  compute  Z  .  Divide  (0,<*>)  into  m  regions, 

nl  ^ 

R,,...,R  .  If  Z  e  R.,  let  N  ■  n.  (1  <  1  <  m),  and  take  n  ■  N-n,  addi- 
1 '  *  m  n^j*  j  v  "  1 

tional  observations.  (It  may  be  desirable  to  take  n^  >  n^  for  small  m). 

We  assume  that  the  n^  are  chosen  as  fn^/Yj]  where  y^  ■  l,...,ym  ■  (n1/n*)(>0) 
are  fixed  constants.  This  makes  nm  ■  fn*] .  The  regions  R^  will  be  chosen  as 
follows:  Rj  is  the  interval  (oCj//2nj ,  oC|//2n^],  R^  (2  <_  j  <_  m)  is  the 
pair  of  intervals  [oCj//2n^,  oC’_j//2 n^ )  U  (oC^  2rv j",  oC^//2nj]  where 

(4.2)  C‘  -  ,  C*  -  (1  <  i  im) 

and  the  values  R*  <  6,  <  •••  6  ,  <  6  ■  1  (x’(l)  «  0,  x+(l)  ■  ®)  are  given. 

—  l  —  m-i  m 

This  choice  of  R^  is  such  that  if  n  is  the  solution  of 

r  «  (o2/n)  (1  ♦  2F(Z^  fiK/o)),  then  n  <6^*  for  (oC"//2nj*)  <  Z„  <  (ocj/^n]'). 
A  conservative  choice  of  sample  sizes  n^  would  be  given  by  n^  >  A  or 
(Ym/Yj )  >  (1  <_  j  <  m).  When  m  »  fn*]  -  fn0]  and  n.  »  fn^  ♦  (i-1), 

6i  "  thcn  th*s  *s  equivalent  to  the  previous  two  sample  procedure. 

Define 

(4.3)  *  nj/N  . 

As  n*  -*■  «°  (and  n,  »  since  y  >0),  the  range  of  possible  values  for 

l  m 

2  2 
Yn  converges  to  (Yj»...,Ym)*  Clearly,  as  n*  increases,  yn  converges  in 

2 

law  to  a  random  variable  y  whose  distribution  is 

(4.4)  P(Y*  ■  Yj)  ■  [♦(x+B)  ♦  ♦(x-8)j|I  1  13  in 
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where  F  (x ) | ,  denotes 

j 


(4.5) 


and  where 


F(CJ)  -  F(C j)  j  -  1 

F(Cj)  -  F(C*  j)  ♦  F(Cj_j)  -  F(C")  j  -  2, . . .  ,m 


(4.6) 


6  ■  /2n^  u/o  . 


Using  (4.4),  it  is  easy  to  see  that 


(4.7)  lim(l/n*)E(N)  -  Y  lim  E(l/Y2)  *  Y  E(l/y2), 

*  'm  n  m  no 

n*-H»  n*-*«> 


and  using  (4.4)  and  (3.27)  that 


(4.8)  lim  lE(ZQN-a)2  -  (1/2)  lim  E(n«/N)  -  (1/2  y  )  lim  E(Y2)  -  (1/2  y  )E(Y2)  . 
n*-«®  r  n*-*»  n 


The  other  expressions  corresponding  to  (3.35)  and  '3.36)  simplify  in  a 


similar  way  to  give 


(4.9)  lim^/iE(X*-9*)  -  (l/^)[Ig  -  ST  I?) 


(4.10)  lim  pE(ZN-u))2  -  (1/2  Y_)[E(y!)  ♦  Ifi  ♦  431  ] 

n*-*» 


(4.11)  lim  iE(X*-0*)2  «  (1/2  Y_)[2E(y2)  ♦  I,  ♦  481,] 

n*-*» 


m  2 

8  -  [  Yv((2-8)«>(r-8)  ♦  (r+e J4>(z**-6) ]  |T 

°  k-1  K  xk 


where 
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i7  -  I  (Yklt(z+ti)*((i-exk)//^)  -  ♦(z-6)»(-(i*0xk)//x^i| 
k*  1  ^  % 


-2/y~  (Ke/Zy^)*  — — 


♦  0  /  [4»(y*0)*((y-0Xk)/A^)  ♦  *(y-0W-(yexj//x7)]dy) 


k'""kJ 


111 

lo  ■  [  Ykl*(z*B)  -  4(z-0)]  I 

8  k-1  K  h 


\  "  (1  -  Yk)/vk 


/  f(x)dx  -  J1  f(x)dx  if  k  -  1 


*  K 

■  J  f(x)dx  ♦  f  f(x)dx  if  k  ■  2,...,m 

r-  r  + 


In  the  particularly  simple  case  in  which  m  =  2,  we  denote  Ym  by  A, 
and  Cj  by  C  and  Cj  by  d,  the  formulas  (4.7)  through  (4.1i)  reduce  to 


(4.12a) 


(4.12b) 


(4.12c) 


lim(l/n*)E(N)  ■  1  -  I. 


lim-yiE(X*-0*)  -  (1//A)[I12  -  ft  In] 


lim  lE(Z  N-a)2  -  (1/2A) [A  ♦  I  ] 
n*-*» 


lira  jE(Z  -a.)2  -  (1/2A)(A  ♦  I  ♦  21  ♦  401  ) 

n*-K» 


(4.1 2d) 
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(4.1 2c )  lira  lE(X*-0*)2  -  (1/A)  (A  ♦  I1(J  ♦  Ig  ♦  261^] 

where, 

I9  -  ((1  -A2)/2)  ( (c>0)4>(c^0)  -  (d*B)$(d*B)  ♦  (c-BH(c-B)  -  (d-B)4(d-8)] 

^0  ■  0-A)l*W“8)  -  ♦  (c-e)  ♦  <X> (-c-B)  -  ♦  (-d-B) 

In  -  6{®(-B/ /k)  -  [(  /  ♦  /)»[:-^-6^]»(y-B)dy]} 

-d  c  l  /x  ' 

-SK  ♦(B/^){1  -  2[*f— ] 1  -  « f — £— ]  1 } 

'■/Ta'  ’-/TP 

-AU(d*8)*((d-BA)/A)  ♦  $(c-6)*(-(c*3A)//T) 

-♦(c+B)*((c-3A)/A)  -  ♦(d-8)*(-Cd*BA)//T)} 

♦  U(d*B)  -  «Hc+B)]  ♦  8(4(d*8)  -  4(c*B)] 

I12  -  ( (1  -A)/ /!)  [$(d+B)  ♦  <Kc-B)  -  ♦(d-B)  -  *(c«-B)] 

A  *  (1-A)/A  . 


Numerical  evaluations  of  (4.12)  were  made  for  a  few  combinations  of  A 
and  6  (•  fij)  (on  which  is  based  through  (4.2)),  namely: 


A\ 

6 

0.8S 

0.85 

0.90 

0.85 

0.90 

0.95 

0.85 

0.90 

0.95 
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The  values  of  x'(6)  and  x*(6)  needed  for  constructing  the  procedures  are 

t 

0. 

0. 

0. 

Tables  3  and  4  show  the  values  of  N  (given  (4.12a))  and  M  (given  by 
(4.12e))  respectively  for  the  various  combinations  of  A  and  6  shown  above. 

For  reference,  N  and  Ff  for  the  "regular"  two  sample  procedure  have  been 
transferred  from  Table  1.  These  computations  suggest  that  the  simplified  two 
sample  procedures  sacrifice  too  much  of  the  information  in  the  first  sample 
resulting  in  very  poor  M.S.E.  characteristics  with  little  compensating  saving 
in  sample  size.  Generally,  they  seem  inferior  even  to  the  conservative  proce¬ 
dure,  and  do  not  pick  up  any  of  the  good  features  of  the  regular  two  sample 
estimator. 

5.  ALTERNATE  LOSS  FUNCTION 

Another  possible  approach  to  constructing  a  sequential  procedure  for  the 

problem  of  Section  2,  is  to  let  the  loss  function  for  an  estimate  based  on  n 

2 

observations  be  cn  +  (X*-0*)  where  c  represents  a  relative  cost  per  observa- 
tion.  The  risk  is  then  cn  ♦  (A*oVn)H(Xj, . . .  ,xjc_1)  (x^  ■  A^/n/o,  1  ^  1  <_  k-1). 

The  optimal  sample  size  is  obtained  by  differentiating  the  risk  with  respect  to 
n  and  is  the  solution  of 

(5.1)  c  «  (A*o2/2n2)G (Xj , . . . ,  x^ ) 

where  G(Xj, . . .,x^_j)  is  given  by  (A.  11).  Define 
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Table  3 


Comparison  of 

N  for 

Alternative 

2-sample  Procedures 

A:  .85 

.90 

.90 

.95 

.95 

.95 

X 

Regular 

6:  .85 

.85 

.90 

.85 

.90 

.95 

0 

.8662 

.9221 

.9481 

.9278 

.9*40 

.9639 

.  9562 

.05 

.8662 

.9222 

.9481 

.9278 

.9741 

.9639 

.9562 

.10 

.8664 

.9223 

.9482 

.9279 

.9741 

.9640 

.9563 

.20 

.8671 

.9229 

.9486 

.9284 

.9743 

.9642 

.9565 

.30 

.8683 

.9240 

.9494 

.9292 

.9747 

.9646 

.9568 

.40 

.8702 

.9255 

.9504 

.9304 

.9753 

.9653 

.9573 

.50 

.8725 

.9274 

.9518 

.9319 

.9760 

.9660 

.9579 

.75 

.8802 

.9337 

.9562 

.9369 

.9783 

.9687 

.9601 

1.00 

.8897 

.9412 

.9615 

.9431 

.9811 

.9719 

.9630 

1.25 

.9004 

.9492 

.9670 

.9499 

.9839 

.9755 

.9665 

1.50 

.9118 

.9573 

.9726 

.9571 

.9868 

.9792 

.9703 

1.75 

.9237 

.9653 

.9780 

.9644 

.9896 

.9830 

.9744 

2.00 

.9357 

.9728 

.9831 

.9715 

.9921 

.9866 

.9787 

3.00 

.9764 

.9935 

.9963 

.9926 

.9984 

.9968 

.9934 

4.00 

.9949 

.9993 

.9997 

.9991 

.9999 

.9997 

.9991 

Table  4 

Comparison  of  M  for  Alternative  2-sample  Procedures 


X 

Regular 

A:  .85 

6:  .85 

.90 

.85 

.90 

.90 

.95 

.85 

.95 

.90 

.95 

.95 

0 

1.1028 

1.0473 

1.0290 

1.6493 

1.6134 

1.0229 

1.0346 

.05 

1.0667 

1.0353 

1.0140 

1.0275 

.9921 

.9965 

1.0041 

.10 

1.0337 

1.0305 

1.0062 

1.0138 

.9779 

.9780 

.9822 

.20 

.9786 

1.0427 

1.0129 

1.0114 

.9721 

.9666 

.9654 

.30 

.9383 

1.0829 

1.0484 

1.0424 

.9964 

.9899 

.9846 

.40 

.9121 

1.1466 

1.1087 

1.1031 

1.0476 

1.0449 

1.0364 

.50 

.8980 

1.2274 

1.1878 

1.1875 

1.1200 

1.1260 

1.1150 

.75 

.8997 

1.4613 

1.4235 

1.4572 

1.3506 

1.3975 

1.3848 

1.00 

.9279 

1.6818 

1.6507 

1.7416 

1.5837 

1.6928 

1.6939 

1.25 

.9629 

1.8539 

1.8290 

1.9923 

1.7714 

1.9554 

1.9956 

1.50 

.9963 

1.9671 

1.9447 

2.1870 

1.8945 

2.1578 

2.2664 

1.75 

1.0245 

2.0178 

1.9927 

2.3119 

1.9456 

2.2832 

2.4867 

2.00 

1.0455 

2.0058 

1.9733 

2.3566 

1.9248 

2.3204 

2.6345 

3.00 

1.0570 

1.5255 

1.4674 

1.8266 

1.4125 

1.7461 

2.2809 

4.00 

1.0224 

1.1047 

1.0802 

1.1843 

1.0611 

1.1447 

1.3598 
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(5.2)  v2  -  (n2c/o2) , 
so  that 

(5.3)  ■  A^/v/i/o  . 


A  sequential  stopping  rule  which  is  natural  to  use  when  the  a^  are  not  known, 
is  stop  the  first  time  that 

(5.4)  c  >  (A*o2/2n2)G(i1,...,xkl) 

where 


*i  "  1  I  1  I  k-1  * 

2 

If  one  studies  the  behavior  of  such  a  rule  for  fixed  o  as  c  +  0,  it  is 

seen  from  comparing  (5.1),  (5.2)  and  (5.3)  with  (2.3),  (2.4)  and  (2.6)  that  the 

behavior  is  almost  identical,  with  /c  replacing  r  and  G  in  place  of  H. 

It  might  have  been  suspected  that  the  difficulties  pointed  up  by  Theorems  1  and 

2  were  due  to  the  choice  of  loss  function  and  criterion  for  choosing  the  sample 

size.  It  might  then  have  been  hoped  that  using  this  loss  function  would  lead 

to  probability  one  convergence  to  unity  (as  c  decreases)  of  the  ratio  of  the 

random  sample  size  N  to  the  optimal  sample  size  (computed  from  (5.1)  with 

fixed  A^) .  Instead,  the  analogues  of  Theorems  1  and  2  hold  as  c  -*•  0  (in 

the  analogue  of  Theorem  1,  the  sample  size  behaves  as  though  all  A^  *  ♦», 

which  from  (A. 12)  implies  that  lim(N/E/o)  ■  }  a.s.).  Suppose  c  is  fixed 

c-*0 

and  a  •*  •,  and  it  is  assumed  that  ((^n/o)A.  ,  1  <_  i  <  k)  have  the  joint 
distribution  of  { (Y#-Y^),  1  <  i  <_  k)  where  the  Y's  are  normal,  variance  1 
and  means  (-/n/o)A^).  For  the  present  sampling  rule  it  is  seen  that  (i^n/o) 
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is  proportional  to  (1 //o)  so  that  the  means  of  the  are  in  the  limit 
(o  ■*  •)  zero,  regardless  of  the  original  A^.  Thus  as  a  «  for  fixed 

N  has  a  limiting  distribution  (not  an  almost  sure  limit)  which  is  inde¬ 
pendent  of  the  A^.  A  result  like  Theorem  2  will  obtain  if  the  A^  are 
proportional  to  /a. 

If  the  global  maximum  of  G  is  at  the  origin  (see  (A.  15)  and  the  dis¬ 
cussion  following  it),  then  the  conservative  sampling  procedure  takes 
n  -  (see  (A.  13)). 

In  the  case  k  ■  2,  the  shape  of  G  is  very  similar  to  that  of  H  and 
all  of  the  results  of  section  3  could  be  transcribed  easily  for  this  loss  func¬ 
tion. 

6.  A  RELATED  PROBLB1 

2 

Let  Xj,...,Xn  be  normal,  mean  w,  variance  T  ,  and  suppose  that  it 
is  desired  to  estimate  |u|  by  means  of  the  estimator  »  (jTj.  The  risk 
of  this  estimator  is 

(6.1)  R(Z,u)  •  (TZ/n)(l  ♦  4F (u/n/ t ) ] 

where  F(x)  is  given  by  (3.5).  The  behavior  of  this  risk  function  is  the  same 
as  of  (3.4)  so  that  sequential  and  multiple  sample  procedures  based  on  substi¬ 
tuting  Zr  for  <d  in  (6.1)  will  have  the  same  properties  as  described  in 
Sections  2  and  3.  The  bias,  expected  Sample  size,  and  risk  functions  for  the 
two  sample  procedure  will  be  very  similar  to  those  tabulated  for  the  case 
k  *  2,  but  not  identical  since  for  (6.1),  R*  ■  0.596,  and  from  equations 
(3.36),  it  is  seen  that  R*  enters  these  expressions  in  a  non-linear  way. 
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APPENDIX 


(A.  1.1 


wher* 

(A. 2) 

Lemma 

taken 

x*  <  x 
1 

Proof : 

(A.  3) 


PROPERTIES  OF  THE  RISK  FUNCTION 


The  risk  function  R(X*;  0j,...,8^)  can  be  expressed  as 


oo  k 

E(x*-e*)2  ■  /  (x-e*)2d  n  «(^iT(x-ei)/o) 


i*l 


/yd  n  ♦O^n"  (y 

-oq  i«l 


*  2  k 


«  /  y  d[  n  * (^(y+A. )/o)  -  n  4>(/n~(-y-*-A,)/o)] 

0  i»l  1  i-1  1 

®  k  k 

-2  /  y [  1  -  n  H*r  (y+A.)/o)  ♦  n  4>(»^r  (-y+A.)/o)]dy 


i-1 


i»l 

k  k 

(oz/n)2  /  y[l  -  n  *(y+x.)  ♦  n  <H-y+x.)]dy 

A  J  _  1  1  ^ 


0 

2 


i«l 


i=l 


*  (A*o  /n)H(xlt...,xk  l) 


*i  *  6*  -  0i,  Xj  -  (t^T A^/o)  . 


A.  1 .  For  fixed  values  of  (x^, •••*Xj_j,x.+1,..  •»xk_1)  H(Xj, . . .  ,xkl) 
as  a  function  of  x^  decreases  for  0  1  x^  <  x|  then  increases  for 


Writing 


2 

,  9H(x  ,  ...,x.)  -x,/2  ®  x.y  -x.y 

(r*)  - aT - ■  2e  /  y$(y)[e  1  n  4>(-y+x.)-e  n 

A  3xi  0  )*i  j  j^i 


♦(y*xj)]dy 


it  is  seen  that  at  x^  ■  0,  the  bracketed  term  in  the  integral  is  negative  for 

all  y  >  0  so  that  the  derivative  is  also.  Further,  a  lower  bound  on  the 

k-1 

bracketed  term  is  [ (1  ♦  x^y)$  (-y ) - 1 )  so  that  the  integral  in  (A. 3)  becomes 

infinite  as  xi  increases.  Taking  the  derivative  of  the  integral  in  (A. 3)  with 
respect  to  x.  gives 


i  / 2  3H(X. ....  ,  X.) 

<7*  ~  -V-l 


®  2  x  y  -  x.y 

/  y  ny)[e  1  n  $(-y*x.)  ♦  e  1  n  *(y*x.)]dy>0 
0  j  j*i  3  jti  3 


so  that  the  derivative  has  only  one  sign  change,  completing  the  proof. 

Two  of  the  more  important  properties  of  the  risk  function  for  k  •  2  will 
now  be  demonstrated. 


Lemma  A. 2. 

(A. 4)  (X*;  Qv62)  <  0  . 


Proof:  From  (3.4),  (3.5),  and  (3.6)  it  is  seen  that 

(A. 5)  -(o2/n2)(l  ♦  2F(x)  -  2nf (x) (x/2n)J 


where  f(x)  =  dF(x)/dx,  and  it  is  easily  checked  that  (A. 5)  becomes 

—  =  -(o2)/n2) (1  -  x*(x)) 


completing  the  proof  since  x<j>(x)  <  $(1)  <  1  for  x  >  0. 
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In  obtaining  the  distributions  in  section  3,  use  was  made  of  the  equivalence 
between  the  statements 


1  7  7 

(n  <  6}  where  n  satisfies:  n  ■  [1  ♦  2F.(n<jo/2/r)] 


and  (u>*  <  ai  <  m*},  where  w.  ■  xj SE,  and  x.  is  one  of  the  two  solutions  of 

0  o  o  6  6 


(A.  6) 


6  •  1  ♦  2F(x)  . 


This  equivalence  depends  on  the  fact  that  u>g(u>5)  is  a  monotone  increasing 
(decreasing)  function  of  5  (0  <  6  <  R*).  The  fact  that  x~(x*)  is  a  monotone 

increasing  (decreasing)  function  of  6  follows  immediately  from  the  fact  that 
(1  ♦  2F(x))  decreases  monotonely  for  x  <  x*  and  increases  monotonely  for 
x  >  x*  (see  Lemma  A.l).  We  now  show  that  behaves  in  the  same  way. 

Lemma  A. 3.  Let  oi~  and  u*  be  the  two  solutions  of 
-  6  6 


(A.  7) 


6  •  1  ♦  2F(u/267r)  . 


has  the  same  sign  as  — r; — ,  and  is  zero  only  when  the  latter  is 

0  0 


zero. 


Proof :  Writing  u>.  as  c  [xj/E),  it  is  seen  that 


(A.  8) 


r  A 

3 t  *  j72(-x!  * 26  ar5 


From  (A. 6)  it  is  seen  that 


1  -  2f<V;nr 


(A.  9) 
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Thus  using  (A. 9)  in  (A. 8)  along  with  (A. 6)  it  is  seen  that 

3to.  3x 

(A-10>  iTVi/l  -  '-'l-WI  *  '  T77«  ’ 

S  (Zt(XfiJJ  6 

and  since  (1  -  xij»(x))  >  0,  this  completes  the  proof. 

It  is  difficult  to  characterize  H(Xj, . . . ,x^_j)  in  great  detail,  but  the 
following  remarks  indicate  some  of  the  known  properties. 

Remarks : 

1)  By  direct  application  of  the  proof  of  Theorem  1  of  Saxena  and  Tong  [3],  it 
is  easy  to  see  that  for  fixed  (Xj,...,x  )  (say) 

max  M(*ji  •  •  "  ^(Xj , . .  •  i Xp, 0, . . . ,  0)  . 

2)  The  unimodal  shape  of  H(x^)  «  H(Xj , . . .  ,x^)  in  x^  (fixed  values  of  (k-2) 

other  x's)  implies  monotonicity  of  the  solutions  x^(6),  x^(6)  to  the  equa¬ 

tion  6  »  H(x^)  (for  6  >  1/A*,  there  will  be  only  one  solution,  x^(6)).  As 
in  the  case  k»  2,  it  would  be  desirable  to  be  able  to  show  monotonicity  of 
(x'(+)(6)//6),  or  alternatively  to  show  that  x^/4T(x^)  is  monotone  increasing 
in  x^  i.e.,  to  show  that  (2H(x^)  -  x^ (dH(x^)/dx^)]  >  0.  This  is  trivial 
when  H(x^)  is  decreasing,  but  illusive  in  general.  As  x^  becomes  very 
large,  H(x.,)  has  a  positive  limit,  while  x^(dH(x^)/dx^)  goes  to  zero.  Thus 
the  difficulty  encountered  is  for  x^  >  x£,  but  not  too  large. 

3)  Similarly,  it  would  be  desirable  to  show  that  for  general  k,  (dR(X*; 

91* *  * .»ek)/dn)  <  °*  Th*s  *s  equivalent  to  showing  that  G(Xj, . . .  .x^^)  >  0 
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for  all  (Xj,...,X|^  j)  where 


(A,  11)  G  (X  j  ,  a  •  .  ,  Xj^  ^j) 


k-1 

[2H(x. , . . .  ,x.  )  -  l  x.  (3H(x. 

l  K  i.\  1  1 


•  •  *Xj^)/ )  ]  • 


The  difficulties  are  of  the  same  nature  as  mentioned  above. 


4)  It  is  not  clear  whether  the  function  H(Xj, . . .  .x^  j)  has  a  unique  minimum. 
The  system  of  equations  3H(Xj, . . .  ,x^)/3Xj  "0  (1  <_  i  <_  k-1)  has  a  solution 
on  the  diagonal  x^  ■  x^  »  • • •  ■  x^  j  ■  x,  given  by 


/  y<'(y)[eXy$k'1(-y+x)-e‘xy«k"1(y+x)ldy  -  0  . 
0 


Whether  this  solution  is  unique  and  whether  there  are  solutions  off 
is  not  clear.  If  a  solution  off  the  diagonal  exists,  then  from  the 
the  function,  any  permutation  of  that  solution  is  also  a  solution. 


siders  a  ray  Xj  *  tr^  (r^  ^  0, 

fixed, 

0  <_  t  <  ®,  1 

<  i  <  1 

3H(trj, ... ,tr^_j,0) 

k-1 

*  l  ri 
i*l  1 

3H(Xj ,..., x^) 

3t 

3x. 

l 

xi-triJ 

the  diagonal 
symmetry  of 
If  one  con- 


It  is  clear  that  this  derivative  is  initially  negative  since  each  term  is,  and 

that  there  is  a  t*  (depending  on  the  r's)  such  that  for  t  >  t*,  it  is 

positive  but  since  the  individual  terms  change  signs  at  different  values  of  t 

and  since  the  terms  are  not  necessarily  monotone  increasing  (because  of  the 
-(tr  )2/2 

e  factor)  it  is  not  clear  whether  there  is  only  one  sign  change. 

There  is  a  possibility  of  the  function  oscillating  in  a  limited  t  range. 
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5)  One  property  which  can  be  established  is  that  R*  is  a  decreasing  function 
of  k,  with  a  lower  bound  which  approaches  (1/2A*)  as  k  increases.  To  see 
this,  let  xj, . .  .,xj|_j  be  such  that 


(xj ,  • .  • ,  xjj ^  j )  ■  min  H (x j , . . . ,  x^^  j )  ■  Rj|  • 

V’-’Vl 


Then 


R*+1  «  min  H(x1# . . . ,xk)  <  min  H(x|, . . . »x*_1,xk) 

Xk 


xl*  *  *  * ,xk 


<  H(xJ, . . .  fX£_j  ,“)  *  H(x*,...,xj|_1)  -  R* 


From  Theorem  4.2.5  of  Dudewicz  [2],  it  is  easily  seen  that 


(1/4*) ( (1/2)  ♦  /  x2d*k(x)J  <  R*  <  (1/A*) 


and  the  lower  bound  decreases  to  (1/2A*)  as  k  increases.  A  somewhat  sharper 
upper  bound  for  R£  is 


(1/A*)  inf  /  y2d$(y)<J>k  *(y+x)  . 

0<x<»  -• 

The  function  G(Xj, .. . ,x^_j)  of  (A. 11)  is  of  interest  in  section  5, 
and  a  few  of  its  properties  will  be  described  here.  Clearly,  since 
(3H(Xj , . . .  jX^  j)/9x^)  goes  exponentially  fast  to  zero  as  x^  increases, 

lim 

min(x1,...,xkl)-*» 


(A. 12) 


G(x1,...,x]c_1)  ■  2H(®,  ...,<*>)  «  (2/A*) 


I 


42 


and 


(A. 13) 


G (0, . . . ,0)  -  2H(0, . . .  ,0)  -  2  >  (2/A*) 


Also 


3G(x. , . . .  ,x.  •)  3H(x. , . . .  ,x,  -)  k-1  3  H(x. ,  •  •  ■  (x,  .) 

<*■“>  -  -  y,-  rJ  • . .  5x7  •  I  xi  tLi, 

j  j  i»l  i  j 


so  that 


(A. 15) 


3G (Xj , . . 4 , x^_j ) 


3x. 


3H(Xj,  •  • .  ,X|(_ j ) 


3Xj 


(0 , . . . ,  0  ) 


(0.....0) 


Thus  GCXj,...,^  p  decreases  from  its  value  at  the  origin  which  in  turn  is 
higher  than  the  value  at  infinity.  Whether  the  value  at  the  origin  is  a  uni¬ 
versal  maximum,  and  details  on  the  shape  of  the  function  are  not  known.  When 
k  =  2,  G(x)  *>  2(1  -  x$(x))  decreases  monotonely  for  0  <  x  <  1,  then 
increases  for  x  >  1  with  a  minimum  value  of  2(1  -♦(!))  ^  2(0.76). 


